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(57) Abstract 



A method of producing a hybrid DNA molecule allowing the assembly of sequences xi, X2 Xn where n is greater than or equal to 

3 (e.g. give sequences) from diverse locations into a hybrid molecule for the purpose of mutation analysis. The method comprising the 

steps of: (1) providing in a single reaction mixture: (a) the sequences xi, X2 Xn and their complementary sequences x\\ X2* Xn', to 

be assembled into the hybrid molecule; (b) for each pair of complementary sequences defined in (a) a respective pair of PCR primers each 
having a priming sequence and which are such that the primers hybridising to the 3* ends of any two sequences (xj, x*(i+i)), where i is 1 
to (n-1), have specifically complementary linker sequences; (2) effecting a first stage PCR reaction in which those primers provided with 
linker sequences are present in limiting concentrations; and (3) effecting a second stage PCR reaction using a single pair of primers one 
of which provides the 5 '-end of the sense strand and other of which provides the 3 '-end of the anti-sense strand of the required hybrid 
molecule; whereby said hybrid molecule is generated. 
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NUCLEIC ACIDS 

The present invention relates to a method of producing a DNA hybrid 
molecule comprised of a contiguous linear array of three or more sequences of interest 
(or potential interest) in predetermined relationship to each other. 

There are a number of applications in which it would be desirable to provide a 
DNA hybrid molecule comprising sequences of (potential) interest in predetermined 
relationship to each other. One such application is in mutation detection where many 
mutation scanning teclmiques which provide positional information are capable of 
scanning up to 1.5kb at a time e.g. COM, EMC, NIRCA, PTT. However, most genes 
are organised into short coding sequences (exons) which are dispersed over 
comparatively large distances. As a result, mutation scanning techniques are not used 
to their full potential and consequently cost vs benefit considerations have limited 
their usefulness and favoured less sensitive, size constrained techniques like SSCP 
and HA. Other altemative methods of circumventing these problems such as RT-PCR 
are fraught with problems of working with RNA and mutant allele loss in autosomal 
conditions. 

A further application in which it may be desirable to provide a DNA hybrid 
molecule is DNA sequencing. 

Various techniques are already known for producing DNA molecules 
comprising exons present at diverse locations in the same or different DNA 
molecule(s). One such technique is Gene "splicing" by overlap extension (SOE) as 
disclosed in US-A-5 023 171 (see also Horton et al, 1989. Gene, 77, 61-68). In this 
technique, two gene fragments to be "spliced" together are initially amplified in 
separate PGR reactions. For each PGR reaction, one of the two primers has a "linker" 
sequence which is attached to the 5' end of the genomic specific sequence .of the 
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primer and which is designed to be complementary to the linker sequence on a primer 
of the other PGR reaction. 

The amplified gene fragments from tlie separate PGR reactions are then 
purified, combined together and subjected to cycles of denaturing, reannealing and 
strand synthesis. As a result of these cycles, the complementary linker sequences 
hybridise to each other and are extended to produce a double stranded molecules 
which comprise the two gene fragments connected by the linker sequences. 

The Gene splicing by overlap extension method requires fragments to b joined 
in a pairwise fashion thus including additional laborious steps of purification and re- 
amplification for the creation of constructs of more than two fragments. Furthermore, 
although the Gene Fusion method utilises Taq polymerase no account is taken in the 
primer design of the 3' dA overhangs introduced by Taq. Consequently with the 
primer design employed in US-A-5 023 171, the majority of fragments will have 3* 
mismatches leading to inefficient fusion of the two fragments. 

A further technique for producing DNA hybride molecules is Gene Fusion as 
disclosed in Nucleic Acids Research, 17, 4895, (1989) (Yon & Freide) where a single 
linking primer is used to fiise the fragments. This is unlikely to be at all efficient for 
the fusion of more than two fragments since the PGR products themselves act as 
substitutes for primers for the whole of the fusion reaction leading to inefficient 
intermediate formation. Also with this published technique with only two fragments 
there is contamination by unfused products within the reaction. 

A further technique for producing DNA hybrid molecules is disclosed in WO- 
A-92 15678 (Stratagene) which describes a PCR-based process for generating a library 
of dicistronic DNA molecules (comprising upstream and downstream cistrons) for 
producing antibodies. The disclosure contemplates combining, in a single reaction 
vessel. 
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(i) a repertoire of first polypeptide genes with a PGR primer pair therefor, one of 
the primers having a first 5'-terminal non-priming portion providing a linking 
sequence; and 

(ii) a repertoire of second polypeptide genes v^ith a PGR primer pair therefor, one 
of the primers having a second 5*-terminal non-priming portion. 

The first and second 5'-terminal portions are such that they are capable of 
hybridising to form a duplex encoding a double-stranded cistronic bridge for linking 
the upstream and downstream cistrons. 

Under the conditions of denaturation, reannealing and strand synthesis 
employed in the reaction, the library of dicistronic molecules is generated. 

According to a first aspect of the present invention there is provided a method 
of producing a hybrid DNA molecule having a sense strand and an anti-sense strand 
and in which, reading in the 5' to 3' direction, the sense strand has the sequences x,, 
X2 x„, where n is greater than or equal to 3, the method comprising the steps of 

(i) providing in a single reaction mixture 

(a) the sequences x,, Xj x„ and their complementary sequences 

x/, X2* x„', to be assembled into the hybrid molecule, 

(b) for each pair of complementary sequences defined in (a) a 
respective pair of PGR primers each having a priming sequence 
and which are such that the primers hybridising to the 3' ends of 
any two sequences (Xj, x\i^,)), where i is 1 to (n-1), have 
specifically complementary linker sequences 
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(2) 
(3) 



whereby said hybrid molecule is generated. 

In a preferred embodiment of the invention, each of the first and second stage 
PGR reactions employs Taq as a thermostable polymerising enzyme. In this case, 
those primers having linker sequences are designed such that their linker sequence is 
connected to their respective priming sequence via an adenosine residue. This takes 
account of the 3' adenine overhang added by Taq at the 3' end of an extended strand. 
Primers incorporating the extra adenosine residue may of course be used both in 
conjunction with any other polymerising enzyme which adds a 3*-adenosine overhang 
at the end of an extended strand and those which do not. 

This constitutes an important feature of the present invention according to a 
second aspect of which there is therefore provided a method of producing a hybrid 
DNA molecule having a sense strand and an anti-sense strand and in which, reading in 

the 5' to 3* direction, the sense strand has the sequences x,, Xj x„, where n is greater 

than or equal to 3, the method comprising the steps of 

(i) providing in a single reaction mixture 

(a) the sequences x,, X2 x„ and their complementary sequences 

x/, X2* Xn', to be assembled into the hybrid molecule, 
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effecting a first stage PGR reaction in which those primers provided 
with linker sequences are present in limiting concentrations, and 

effecting a second stage PGR reaction using a single pair of primers 
one of which provides the 5'-end of the sense strand and other of which 
provides the 3'-end of the anti-sense strand of the required hybrid 
molecule 
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(b) for each pair of complementary sequences defined in (a) a 
respective pair of PGR primers each having a priming sequence 
and which are such that the primers hybridising to the 3' ends of 
any two sequences (x^, x*(i+,)), where i is 1 to (n-1), have 
specifically complementary linker sequences connected to their 
respective priming sequences via an adenine residue, and 

(2) effecting a PGR reaction using a thermostable polymerase which may 
or may not add a 3' adenine overhang to the end of an extended strand. 



The invention enables a DNA hybrid molecule comprised of the sequences x,, 

Xj x„ (and their respective complements x/, X2' x^) in predetermined linked 

relationship to be produced in high yield from the respective individual sequences 
provided at diverse regions of the same or different DNA molecules. The linker 
sequences which are used are independent of the sequences which are to be Eissembled 
into the intended hybrid molecule. Furthermore the incorporation of the additional 
adenosine residue for primers to be used in conjunction with Taq as the polymerising 
enzyme avoids 3' mismatches leading to inefficient fusion of fragments. 

The sequences x, Xj x^ may for example be exons which are separated by 

any distance (possibly of unknown sequence) along a particular DNA molecule 
whereas in the hybrid molecule produced by the method of the invention the 
sequences are spaced from each other by a relatively short, and known, sequence of, 
say, 20 to 30 bases. 

Hybrid molecules produced in accordance with the method of the invention 
may for example be constructed for the purposes of mutation detection. More 
p£irticularly, the hybrid molecule may comprise a plurality of exons (in which ojie or 
more mutations may be present) in a molecule having a length of up to 1.5 kb thus 
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permitting mutation scanning of all of these exons using a technique such as CCM, 
EMC, NIRCA or PTT. A further possibility is a hybrid molecule comprised of a 
plurality of exons which may be constructed to enable more efficient sequencing of 
exons (as compared to their sequencing the exons at their locations in the endogenous 
DNA molecule). 

A further possibility is that the method of the invention is used for the 
construction of hybrid genes. 

The method of the invention involves two PGR reaction steps which allow the 
desired DNA hybrid molecule to be produced from the individual sequences x,, 

X2 Xn (and their complements Xj xj which are provided in a single reaction 

mixture. It will however be appreciated that, in certain circumstances, only one or 

other of the sets of complementary the sequences (x, Xj x„) or (x/ X2' x„*) 

may initially be present at the start of the first PGR reaction, in which case the 
"missing" complementary sequences are generated in situ in the initial PGR reaction 
stage. 

For the first stage of PGR reaction, there is a pair of PGR primers for each set 

of complementary sequences (x,, x/), (Xj, Xj') (x„, x„*) which are to be present in the 

intended DNA hybrid molecule. The primers hybridising to the 3' ends of the 
sequences x/ and x„ may be "standard" PGR primers and may hybridise either to the 
sequences x,* or x,, (as the case may be) or to a region external thereto (since the 3' 
ends of the sequences x/ and x^ to be incorporated in the hybrid molecule are 
determined by the primers used in the second stage of the PGR reaction (see below)). 

The other primers for the first stage of PGR reaction are in effect modified 
PGR primers and comprise a priming sequence (i.e. a sequence which will hybridise 
to the appropriate x or x' sequence in the manner of a standard PGR primer) attached 
to a linker sequence. (For convenience, primers incorporating linker sequences are 
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also referred to herein as "linker primers"). The linker sequences are such that the 
primers which hybridise to the 3* ends of any two sequences (Xj, X(i+,))where i is 1 to 
(n-1) have linker sequences which are specifically complementary to each other, i.e. 
these linker sequences will hybridise to each other but not to any other sequences in 
the reaction mixture. The manner in which the linker sequences function to provide 
for assembly of the required DNA hybrid molecule will be more fully appreciated 
from the description given below in relation to the drawing. 

Adopting this type of primer design, it is possible to employ linker sequences 
which are independent of the x and x, sequences to be incorporated in the desired 
hybrid molecule so that it is not necessary to ensure that the linker will hybridise to a 
part of an X or x* sequence to be linked into the hybrid molecule. This design of the 
linkers is thus not constrained. 

The linker sequences may for example be comprised of 20 to 30 bases and are 
ideally such that they do not have any secondary structure (e.g. "hairpins"). It is 
preferred that the annealing temperature (Tm) of the complementary pairs are 
substantially the same and is 2-5°C higher than the annealing temperature of the 
primers to the x and x* sequences. 

Purely by way of example, there is set out below complementary sets of 
complementary linker sequences which may be incorporated in linker primers for the 
assembly (using PGR reactions employing Taq as the polymerising enzyme) of a 
hybrid DNA molecule comprised of five sequences x,.5 and their respective 
complements. In the following, the term "specific sequence" is the priming portion of 
the linker primer. 

5'tcatattagccgctgcattgcc-a-specific seq 3' 
5'ggcaatgcagcggctaatatga-a-specific seq 3' 
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5'agccactacccaaactcctgt-a-specific seq 3' 
5'acaggagtttgggtagtggct-a-specific seq 3' 

5'tgtctcactgaacctgcctacct— a-specific seq 3* 
5'aggtaggcaggttcagtgagaca-a-specific seq 3' 

5'cctcattaccggctgtcagactg-a-specific seq 3' 
5'cagtctgacagccggtaatgagg-a-specific seq 3' 

In the above sequences, the "a" represented in bold is an additional adenosine 
residue incorporated in the primer to take account of the 3' adenosine overhang added 
by Taq at the end of an extended strand. 

In the first stage PGR reaction, the linker primers, and preferably also the 
standard PGR primers for the 3' ends of the sequences x/ and x„, £ire provided in a 
limiting concentration. A "limiting concentration" is a concentration of primers in a 
PGR reaction leading to inefficient amplification and such that an increase in 
concentration results in an increase in product yield. The ratio of primers to single 
copy template in the limiting PGR reaction may typically be about 1 x 10^:1 to about 1 
x 10^:1 (e.g. about 3 x 10^:1). A standard limiting concentration may for example be 
40nM. The limiting concentration may however readily be determined empirically for 
each set of primers used. 

Subject to the use of limiting primer concentrations as outlined above, the first 
stage PGR reaction may be conducted under the conditions (e.g. polymerase enzyme, 
nucleotides, buffers, temperature cycling etc.) will be understood by those skilled in 
the art. For example, the temperature cycling may involve stages of denaturation at 
90"G, hybridisation at eO'^G, and strand synthesis at 72°C. 
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It is particularly preferred that the polymerase enzyme which is used for the 
first stage reaction is one formulated so as to be activated by heat (e.g. at 90*^ to 95°C) 
so that there are no non-specific hybridisations being extended at low temperature. 

Heat activated polymerase enzymes are know to those skilled in the art and 
examples of such enzymes which may be employed in the method of the invention 
include AmpliTaq Gold (ex Perkin Elmer) and Platinum Taq (ex Gibco BRL). 

In a preferred embodiment of the method of the invention, the product of the 
first stage PGR reaction is treated, prior to the second stage, to prevent the extension 
of unwanted non-stringent hybridisations between residual primers and the template 
DNA. This ensures a proper "Hot Start" for the second stage PGR reaction. This 
treatment may for example comprise an exonuclease digestion involving addition of 
exonuclease to the first stage reaction mixture followed by incubation of the mixture 
at ^TC (e.g. for 15 min) and then at 80°C (e.g. for 30 min) so as to remove any single 
stranded molecules. Alternatively, and more preferably, the first stage reaction 
mixture is cooled, e.g. at -20*'C, to inactivate residual DNA polymerase activity. 

In the second stage reaction there is employed an excess of two PGR primers 
one of which hybridises to the 3' end of sequence x,' and the other of which hybridises 
to the 3' end of the sequence x„ (and which therefore respectively provide the 5' ends 
of the sense and anti-sense strands of the intended hybrid molecule). The excess is a 
concentration of primers such that an increase in concentration does not produce an 
increase in yield. The "excess" may readily be determined by a person skilled in the 
art and for a typical reaction might for example be 500nM. 

Fresh polymerising enzyme is used for the second stage PGR reaction and this 
is preferably a heat activated polymerase enzyme as described for the first stage 
reaction. 
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The second stage reaction may be conducted under temperature cycling 
conditions as described for the first stage reaction. 

The invention is further described with reference to the accompanying 
drawings, in which: 

Fig. 1 illustrates a DNA hybrid molecule and a "naturally occurring" DNA 
molecule from which it is produced; and 

Fig, 2 illustrates steps in the conversion of the "naturally occurring" DNA 
molecule to the hybrid molecule. 

Figs. 3 to 7 illustrate the results of the Examples. 

The invention is described by way of example only with reference to the 
synthesis of a DNA hybrid molecule 1 (see Fig. 1) comprised of exons present in a 
naturally occurring DNA molecule 2 (see Fig. 1). More particularly, the molecule 2 is 
shown as being comprised of sense and anti-sense strands 3 and 4 respectively with 

the former incorporating exons Xj, Xj, X3 x^ of interest (e.g. for the purpose of 

mutation analysis). These sequences x,, Xj, X3 x„ have their respective 

complementary sequences x/, X3' x„' in the anti-sense strand 4 shown and may 

for example be separated from each other by several hundred bases. 

The hybrid molecule 1 is illustrated as having sense and anti-sense strands 5 

and 6 respectively with the former incorporating the sequences x,, Xj, X3 x^ and the 

latter incorporating sequences x/, X3' x„*. As shown, sequences x, and Xj are 

connected (reading in the 5* to 3* direction) by the sequence -t-Li2-a" where a and t 
represent adenine and thymidine residues respectively and Ljj is a linker sequence (the 
subscript "12" indicating that the linker is between sequences Xi and Xj. Similarly 
sequences Xj and X3 are connected by the sequence -t-L23-a £ind so forth. 
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Reference is now made to Fig. 2 which illustrates the manner in which 
molecule 1 is synthesised from molecule 2. More particularly, the synthesis involves 
a two stage PGR reaction employing heat activated Tag as the polymerising enzyme 
and in which the first stage utilises limiting concentrations of a plurality of pairs of 

modified PGR primers, one pair for each of sequences (x,, x/), (xj, Xj*), (X3, X3') 

(x„,x„*) to be incorporated in the hybrid molecule. For each such sequence (Xj, Xj') 
(where j is 2 to (n-1) which is to be an intemal sequence in the hybrid molecule 1 (i.e. 
all sequences except (x,, x/) and (x„, x„')) the primer pair comprises 

(i) a first primer having a priming sequence which will specifically hybridise to 
the 3' region of the "sense sequence" Xj and which is connected at its 5'-end to a linker 
sequence L*j (j+i) via an adenine residue ("a"); and 

(ii) a second primer having a priming sequence which will specifically hybridise 
to the 3' region of the "antisense sequence" X;' connected at its 5'-end to a linker 
sequence Lj.,j via an adenine residue ("a"). 

The primer pair for sequence (x„ x/) comprises a primer as defined under (i) 
above (in which the linking sequence is L,2) and a conventional PGR primer (PI) 
specific for the 3' region of sequence x/. The primer pair for sequences x„, x„' 
comprises a primer as defined under (ii) above (in which the linking sequence is L^„. 
i) n) and a conventional PGR (P2) primer specific for the 3' region of sequence x„. 

The manner in which the described primers are intended to hybridise to the 
starting DNA molecule 2 (when denatured into its component strands) is as illustrated 
in Fig. 2. 

The linker sequences which form part of the modified PGR primers are such 
that they do not have any intemal secondary structure (e.g. "hairpins") and that two 
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such complementary sequences have an annealing temperature (T^J which slightly 
exceeds that of the priming sequence and its complement. Examples of suitable linker 
sequences have been given above. 

In the first stage PGR reaction, the "starting" DNA molecule 2 is treated with 
the sets of primers as described together with a heat activated Tag polymerase (e.g. 
having an activation temperature of ca 94°C) in an appropriate buffer. In this first 
stage, all of the primers are present in limiting concentration. 

The first stage is conducted under conditions of temperature cycling such that 
there is an initial, relatively high temperature, denaturation step (e.g. at 95**C), 
followed by a hybridisation step (e.g. at 60°C) followed by a strand synthesis step 
(e.g. at 72"C). 

As a result of this series of temperature cycling steps, a PGR reaction is 
effected resulting in the generation of the "short" products (see Fig. 2) in which the Sp- 
ends of £ill X and x* sequences (except Xj and Xj,') are connected to their respective 
linker sequences via an adenine residue "a" and the 5'-ends of all sequences (except x/ 
and x„) are connected to their respective linker sequences via a thymidine residue "t". 
Furthermore, it will be noted that the 3*-end of each of the short products has an 
adenine residue as added by the Tag polymerase. 

Whilst we do not wish to be limited by any particular mechanistic 
interpretation, we believe that the first stage PGR reaction also results in a degree of 
formation of "longer" products. The manner in which such "longer" products will 
form from selected ones of the short products is illustrated in those steps of the 
reaction illustrated within the deished box. More particularly, complementary linker 
sequences are able to hybridise to each other and it will be noted that their flanking "t" 
and "a" residues respectively hybridise to the "a" and "t" residues which fleink the 
complementary linker sequence. Thus "self-primed" constructs are generated and 
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extension can occur in the direction of the arrows to produce the "longer" products. 
Moreover it will be appreciated that a terminal linking sequence of such longer 
product so generated may be able to hybridise with a terminal linking sequence of a 
further "longer" product or of a short product so that further extension is possible to 
produce fragments of greater length. (It should be noted that the generation of the 
longer products may take place partly in the first stage reaction and partly in the 
second stage reaction (discussed below) although we do not wish to preclude the 
possibility that these products are formed wholly in either the first or second stage 
reaction. It is for this reason that the generation of the "longer" products is illustrated 
within the box defined by dashed lines and the first and second stages of the PGR 
reactions are connected by dashed arrows. 

At the end of the first stage reaction, the reaction mixture is preferably frozen 
to -20°C to deactivate any residual polymerase activity. 

For the second stage of the reaction, an excess of flanking primers which 
define the 5' ends of the sense and anti-sense strands of the intended hybrid molecule 
is added to the reaction mixture. Fig. 2 illustrates these flanking primers are 
illustrated as FPl and FP 2 together with the locations at which they hybridise. Also 
provided for the second stage reaction is further polymerase which is activated only at 
elevated temperature together with buffers, nucleotides etc. The reaction mixture is 
then subjected to temperature cycling as previously i.e. denaturation (e.g. at 95°C for 1 
min), hybridisation e.g. at 60°C (e.g. for 1 min) and synthesis (e.g. at 72*'C for 2 
minutes). 

Since the polymerase is only activated at elevated temperature, it is ensured 
that any sequences which randomly hybridise during the initial heating of the reaction 
mixture become denatured before the temperature at which the polymerase is 
activated so that there are substantially no extension reactions resulting from these 
hybridisations. Put another way, the only hybridisations which occur above the 
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activation temperature of the polymerase are those which are required for generation 
of the hybrid molecule 1 as explained more fully below. 

As a result of the second stage reaction there is produced the intended hybrid 
molecule. 

It should be appreciated that a number of modifications may be made to the 
illustrated protocol. For example, at least some of the sequences x, Xj-.-.x^ may 
initially be provided on separate chromosomes. 

Furthermore, the primers p, and P2 may be external to the sequences x'| and x„ 
respectively. 

The invention is further described by the following non-limiting Example. 

Example 1 
1. 1 MATERIALS AND METHODS 
1,1,1 Primer design 

A set of primers were designed to amplify exons 6, 7, 8, 9 and 10 of the 
Neurofibromatosis type 2 (NF2) gene (see Appendix A). These five exons comprise 
approximately the middle third of the coding sequence of the gene. Primers were 
designed with the assistance of the Oligo v 4.1 program (MedProbe, Postboks 2640, 
St. Hanshaugen, N-0131, Oslo, Norway). The genomic specific segments of the 
primers were selected to have closely equivalent Tj^s (average T^ = 62.3^0 @ 
180mM salt concn.). The complementary 5* termini of the self assembling primers 
were designed with randomly selected sequence of approximately 50% GC content, 
with no internal secondary structure and to have T^^s that exceeded those of the 

genomic specific seqments in order to favour self assembly (average Tj^ = 64.9^C @ 
180mM salt concn.). A supplementary adenine residue was inserted between the 
genomic and 5' complementary segments of the self assembling primers in order to 
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accommodate the 3' adenine overhangs added by Taq polymerase to the nascent DNA 
strand. All primers were checked for homology to Alu repeat sequences using the 
BLAST analysis program available at http://www.ncbi.nlm.nih.gov/cgi- 
bin/BLAST/nph-blast. Table 1 gives the sequences and Tj^^s of the 12 primers used in 

the NF2 exon 6-10 array. 

Primers were all synthesised at O.OSiiM scale and were reverse phsise cartridge and 
HPLC purified (MWG Biotech). 

1,1.2 Amplification of self assembling arrays 

DNA was extracted from peripheral blood lymphocytes on an Applied 
Biosystems 3 80 A DNA extractor. 

PGR amplification of the self assembling DNA arrays was carried out in two 
stages. The primary reactions were carried out in 20|j.1 volumes using 50ng of 

genomic DNA, 40nmol.L-l of primers I-IO (Table 1), 750^mol.L-l of each dNTP, 
0.6U Platinum Taq polymerase (GibcoBRL) in a IxPCR buffer comprising 67mM 
Tris-HCl (pH8.3 @ 25''C), 16.6mM ammonium sulphate, 3.7mM MgCl2 and 
0.085mg.mL'^ BSA. PGR amplification was carried out on Perkin-Elmer 2400 or 
9600 thermal cyclers using the following parameters; initial denaturation 94*^G (3 
mins), 30 cycles of 940G (1 min); eO^C (1 min); 720C (2 mins) followed by a final 
synthesis of 72^G (10 mins). Immediately on completion of cycling the primary PGR 
reactions were frozen at -20^G to inactivate residual DNA polymerase activity. 
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The secondary reactions were carried out in separate 20|al volumes using 2[xL 
of the primary PGR, SOOnmoLL"! of primers 11 and 12 (Table 1), 200|imol,L-l of 
each dNTP, 0.6U Platinum Taq polymerase (GibcoBRL) in a IxPCR buffer 
comprising SOinM Tris-HCI (pH9.0 @ 25°C), 20mM ammonium sulphate and 1.5mM 
MgCl2. The thermal cycling conditions were identical to those used for the primary 
amplifications. 

1.1.3 Labelling of DNA 

Depending upon the final application the internal primers (11 & 12) were 
either unlabelled (sequencing and CCM probe DNA) or 5' biotin labelled (CCM test 
DNA). For the production of internally labelled fluorescent CCM probe DNA, 
secondary PCRs were set up with primary PCRs from normal control DNA, with the 
addition of TAMRA labelled dCTP (Perkin-Elmer) to a final concentration of 
SOOnmol.L'* of secondary PCRs. Otherwise reaction conditions were identical to 
those described previously for secondary PCRs. 

1.1.4 Cycle Sequencing 

For sequencing the remainder of the product was then electrophoresed on a 
preparative 1% agarose gel and the 1046bp band was excised and purified using a 
QiaQuick gel purification kit (Qiagen) according to the manufacturer's protocol with 
the modification that the dissolved gel slice was passed through the column three 
times during the binding stage in order to maximise recovery efficiency. The purified 
sample was then eluted off the column by the addition of 40p.L of lOmM Tris.HCl 
(pH 8.0). 

40-60ng of each sample was sequenced in both orientations using either 
primers 11 or 12 with BigDye terminator cycle sequencing kits (Perkin-Elmer, 
Applied Biosystems)). The manufacturer's protocols were followed with the exception 
that the annealing temperature for the cycle sequencing reaction was increased from 
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SO^C to 550c to reflect the high of primers 11 and 12. The sequencing reactions 

were then electrophoresed and collected on an Applied Biosy stems 377 fluorescent 
sequencer using 48cm well to read plates. 

1.1.5 Chemical Cleavage of Mismatches (CCM) 

Heteroduplexes were formed by heating equal quantities of unpurified 
biotinylated test DNA with unpurified internally TAMRA labelled normal control 
probe DNA at 94^0 for 5 mins followed by annealing at 65^C for 1 2 hours. 

The heteroduplexed DNA was purified by electrophoresing each sample 
through a 1% low gelling temperature agarose gel and cutting out the 1046bp band 
using a sterile scalpel. The volume of gel slice was estimated by weighing and the gel 
slice equilibrated for 20 mins in an equal volume of 1 x P-Agarase buffer comprising 
lOmM Bis-Tris HCI (pH6,5) and ImM EDTA. The p-Agarase buffer was then 
removed and the gel slice heated at 70*^0 for 10 mins to liquefy the agarose followed 
by cooling to 37°C. A further equal volume of 1 x P-Agarase buffer was added 
followed by lU of p-Agarase I (USB Biochemical), The gel slice was then digested 
ovemight at 37*^0. 

30[iL of each digested heteroduplex DNA sample was then combined with 10 
[ig of Dynabeads M-280 Streptavidin (Dynal), freshly washed according to the 
manufacturer's instructions and resuspended in 30faL of a binding and washing buffer 
comprising 2M NaCl, lOmM Tris.HCl (pH8.0), O.lmM EDTA and 0.4% Tween 20. 
The heteroduplexes were then left to complex with the beads for 1 hour at 42^C. The 
samples were then placed on a magnet to separate the beads and the supernatant 
carefully removed using a pipette. 



1.1,6 Hydroxylamine modification 
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The beads with bound DNA were resuspended in 20^L of a solution of 4M 
hydroxylamine, 2.3M diethylamine and incubated at 37^C for 2 hours. The samples 
were then placed on a magnet to separate the beads and the supernatant was carefully 
removed using a pipette. Piperidine cleavage of modified bases was then immediately 
carried out. 

1.1.7 Potassium permanganate modification 

The beads v^th bound DNA were resuspended in 20}iL of a solution of ImM 
KMn045 3M tetraethylammonium chloride and incubated at 25^C for 10 mins. The 
samples were then placed on a magnet to separate the beads and the supernatant was 
removed using a pipette. Piperidine cleavage of modified bases was then immediately 
carried out. 

1.1.8 Piperidine cleavage 

Prior to piperidine cleavage the beads were rinsed once v^th 50|-iL of TE 
buffer. Then the beads were resuspended in 5|aL of a IM solution of piperidine in 
deionised formamide to which Genescan 2500 Rox size standard (Perkin-Elmer) and 

dextran blue loading dye (Sigma) had been added. The samples were heated at 90^C 
for 30 mins to cleave the modified bases, snap chilled on ice and placed on a magnet 
to separate the beads from the unbound DNA now in the liquid phase. 

1.1.9 Fluorescent fragment analysis 

The supernatant was loaded onto a 24cm well to read 4% denaturing 
polyacrylamide gel on an Applied Biosystems 373XL fluorescent analyser. The gel 
was electrophoresed at 2500V, 30mA and 35 W for 12 hours and data was collected 
with filter set A. After electrophoresis each lane was tracked and size calling 
performed using the internal size standard as a reference. Test samples were then 
compared with the normal control sample for the presence of novel cleavage products 
by overlaying using Genescan software. 



wo 99/64624 



PCT/GB99/01691 



20 

1.2 Results 

1.2.1 A 2|al portion of the product of step 1.1.2 applied to a 1% agarose gel 
confirmed the presence of the expected 1 046bp fragment. 

1.2.2 PGR amplification of each individual exon from the product of step 1.1.2 
under common amplification conditions gave a single fragment of the expected size, 
confirming the integrity of the primer syntheses and the compatibility of the primer 
pairs, (see Fig 3a.) 

1.2.3. Figs 4a and 4b show^ the product of step 1.1.2 sequenced from both termini. 
Fig 5a shows sequencing in the forward orientation using primer 11. Fig 4b shows 
sequencing in the reverse orientation using primer 12. The positions of the intron 
exon boundaries, primer annealing sites and linker sequences are indicated on the 
electropherograms. The product was seen to comprise the expected exons in the 
correct orientations. No discernible degradation of sequence was observed across the 
transitions from one component to the next. 

1.2.4 The procedure of step 1 .1 .2 was repeated for a range of genomic DNA samples 
and consistently yielded fragments of the expected size. Reaction yields were 
generally high (--SOOng per 10|al) with low background (see Fig 5a). 

Example 2 

In order to test the suitability of the products obtained by the procedure of the 
invention for downstream applications like mutation scemning, products from a series 
of 7 heterozygotes for NF2 mutations spread throughout four of the five NF2 CR 
exons were directly sequenced to confirm that the genotype present in genomic DNA 
was correctly represented in the product. The 7 mutant heterozygotes were as follows 
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nt600-3C>G (exon 7), nt676-7T>G (exon 8), nl713delC (exon 8), nt784C>T (exon 8), 
nt855delT (exon 9), nt887delT (exon 10) and nt948G>T + nt949G>T (exon 10). All 7 
mutations were clearly visible in the heterozygote state. Fig 6 gives example data 
from four representative mutation heterozygotes. We have also obtained heterozygote 
sequencing data of comparable quality on products that have not been gel-purified 
(data not shown). Thus, the gel purification step may be omitted in many protocols. 
1 1 NF2 mutant heterozygotes were then retrospectively screened using a modified 
fluorescent solid-phase COM method based on that of Rowley et al, as disclosed in 
Genomics 30, 574-582. The products were internally labelled using TAMRA dCTP in 
preference to end labelling. This method helps eliminate false positives resulting from 
background cleavage. Cleavage of internally labelled products should result in two 
labelled fragments with a total molecular weight equal to the uncleaved product. 
Mismatched cytosines were modified using hydroxylamine. However, on the grounds 
of safety and convenience mismatched thymines were modified using the potassium 
permanaganate in preference to osmium tetroxide. 

The results are shovm in Fig 6 Cleavage products are clearly visible in the 
lanes containing products from the following mutation heterozygotes Lane 3 - nt600- 
3C>G (exon 7); Lane 6 - nt713delC (exon 8); Lane 8 - nt678insC (exon 8); Lane 10 - 
nt770delC (exon 8); Lane 11 - nt676-7T>G (exon 8) and Lane 12 - nt948G>T + 
nt949G>T (exon 10). The remaining samples are normal controls or from mutations 
that are expected to give mismatches unaffected by hydroxylamine modification. 

Of the 11 mutations, 10 were detectable using either hydroxylamine or 
potassium permanganate modification (see Table 2). 
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Table 2: Detection of 1 1 NF2 mutation heterozygotes by fluorescent solid-phase 
CCM. The predicted mismatched bases/pairs of bases eire given for each mutation. 
Positive cleavage after the two modifications is indicated as either Yes - clear 
cleavage, (Yes) - weak but reproducible cleavage, No - no detectable cleavage. 



Mutation (Exon) 


Mismatch 


Hydroxy lamine 


KMnO^ 


nt600-3C>G (Ex 7) 


G:G + C:C 


Yes 


No 


nt676-7T>G (Ex 8) 


T:C + A:G 


Yes 


Yes 


nt676-2A>T (Ex 8) 


T:T + A:A 


No 


No 


Nt678insC (Ex 8) 


G + C 


Yes 


(Yes) 


Nt713delC (Ex 8) 


G + C 


Yes 


No 


Nt770delC (Ex 8) 


G + C 


Yes 


No 


Nt784C>T (Ex 8) 


T:G + A:C 


(Yes) 


No 


Nt855delT (EX 9) 


A + T 


No 


Yes 


nt887delT (Ex 10) 


A + T 


No 


Yes 


Nt90 3C>T (Ex 10) 


C:A + T:G 


Yes 


- No 


nt948G>T + 
nt949G>T (Ex 10) 


T:C + A:G 


Yes 


Yes 



The mutation not detected by either condition, nt676-2A>T, is predicted to 
produce heteroduplexes detectable only by potassium permanganate modification 
(T:T/A:A mismatches). Furthermore, two other mutations producing heteroduplexes 
with mismatched thymines (nt784C>T and nt903C>T) had undetectable levels of 
cleavage after potassium permanganate modification. However, both these mutations 
also produce mismatched cytosines which were detected after hydroxylamine 
modification. All of the mutations predicted to produce heteroduplexes with 
mismatched cytosines produced visible cleavage products after hydroxylamine 
modification. 
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Example 3 

3,1 Procedure 

The procedure of steps 1.1.1 and 1.1.2 (see Example 1) was repeated to 
amplify exons 8, 9, 10, 11 and 12 of the Human mismatch repair, hMLHl gene. The 
primers are as shown in Table 3. 

The genomic specific segments of the primers were selected to have closely 

equivalent T^^s (set average = 60.8^C at a salt concentration of 180mM). The 

complementary 5* termini or linker segments of the primers were designed with 
random sequence of approximately 50% GC content, with no internal secondary 
structure and to have Tjy|S that exceeded those of the genomic specific seqments in 

order to favour self assembly (set average Tj^^ = 65.4^0, both at a salt concentration 

of 180mM). A supplementary unmatched adenine residue was inserted between the 
genomic and 5' linker segments of the self assembling primers in order to 
accommodate the 3' adenine overhangs added by Toq polymerase to the nascent DNA 
strand. All primers were checked for homology to Alu repeat sequences using the 
BLAST analysis program available at http://www.ncbi.nlm.nih.gov/cgi- 
bin/BLAST/nph-blast. 
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3.2 Results 

3.2.1 A 2[xl portion of the product of step 1.1.2 applied to a 1% agarose gel 
confirmed the presence of the expected 1 247bp fragment. 

3.2.2 PGR ampHfication of each individual exon from the product of step 1.1.2 
under common amplification conditions gave a single fragment of the expected size, 
confirming the integrity of the primer syntheses and the compatibility of the primer 
pairs, (see Fig 3b.) 

3.2.3 The procedure of step 1 . 1 .2 was repeated for a range of genomic DN A samples 
and consistently yielded fragments of the expected size. Reaction yields were 
generally high (--SOOng per lOjal) with low background (see Fig 5b). 
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Genomic sequences of NF2 exons 6-10 amplified by the self assembling array. Exonic 
sequence is indicated by capitals, intronic sequence is in lower case. The annealing 
sites of primers used in the primary PCR reaction to generate the five self assembling 
DNA fragments are marked in bold underlined sequence. The annealing sites of the 
internal primer pair used during the secondary PCR to drive the array assembly are 
marked in underlined italics. The primary and secondary primers for exon 10 reverse 
overlap, the region of overlap is marked in bold, underlined italics. 

NF2 Exon 6 (SEQ ID NO. 25) 

tctgtgtgac tactcctggt gtagctttaa aatagcttta ctgtttgtaa aatgatgcat 60 

aattataaaa gtggcaaaca ataccaaatt t-ac tt catgt gtaggttttt tattttgctc 120 

tattttttgg tagGTAATAA ATCTGTATCA GATGACTCCG GAAATGTGGG AGGAGAGAAT 180 

TACTGCTTGG TACGCAGAGC ACCGAGGCCG AGCCAGgtga ggcccattca ttgttggttt 240 

acattccttt, atggg c 256 

NF2 Exon 7 (SEQ ID NO. 26) 

gaatgcttga tttggtggcc cac ccgctct ccacccatct cac ttagctc caatgacagt 60 

gtcttccgtt ctccccacag GGATGAAGCT GAAATGGAAT ATCTGAAGAT AGCTCAGGAC 120 

CTGGAGATGT ACGGTGTGAA CTACTTTGCA ATCCGGgtgt gttgaaacct ctctgagctc 180 

cttgtgtagt aga cagagac t.gagtgaggg cca qqactgc taaaatggtt acttcttcat 240 

NF2 Exon 8 (SEQ ID NO. 27) 

tctgtggacc tgctgaactg cacatgtgac agtgtgtgcc agattctttg gaaggttgaa 60 

taaaatttt g agccfccagct ggcgcttac a gtagctgttc ttattggatc cacagAATAA 120 

AAAGGGCACA GAGCTGCTGC TTGGAGTGGA TGCCCTGGGG CTTCACATTT ATGACCCTGA 180 

GAACAGACTG ACCCCCAAGA TCTCCTTCCC GTGGAATGAA ATCCGAAACA TCTCGTACAG 24 0 

TGACAAGGAG gtaggacatg tgtgtact gc agatgggtcc agcagatc tt tccctgtctg 300 

cccccctcac tggagcctcc ccagccaggg catctccttg ttattcatag agtcctttaa 360 

ttcccaggct ttgagggtgt ggttgtt 387 
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NF2 Exon 9 (SEQ ID NO. 28) 

gacttggtgc tcctaattcc ctgaggttta gtgcctggat actgggaagc cagnacaagg 60 

gcataacntc atgctggtct gtggccagtg tggttgcgca tttgtggaat tnccaattgc 120 

tggtaacatt cc aggctgtc ggactgaaac t.g tqttctqc ttcattcttc cagTTTACTA 180 

TTAAACCACT GGATAAGAAA ATTGATGTCT TCAAGTTTAA CTCCTCAAAG CTTCGTGTTA 240 

ATAAGCTGGT AAGTTGAGAT CCTGgtaagt tgagatcctg gttttcatta ctgataatgg 300 

tagct-tttLCt. gagaa 315 

NF2 Exon 10 (SEQ ID No. 29) 

tgctacctgc aagagctcaa actgctatgg cactagtggg ccagt a ggcagtgaagtaaa 60 

tttgtggat a ttaacctttt tgtctgcttc tgtggccaca gATTCTCCAG CTATGTATCG 120 

GGAACCATGA TCTATTTATG AGGAGAAGGA AAGCCGATTC TTTGGAAGTT CAGCAGATGA 180 

AAGCCCAGGC CAGGGAGGAG AAGGCTAGAA AGCAGgtgag cacaaccttg ttttaa ctga 240 

tqatQtcact g-tgtggtca.g tcct.ggcct 2 69 
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CLAIMS 

1 . A method of producing a hybrid DN A molecule having a sense strand and an 
anti-sense strand and in which, reading in the 5' to 3' direction, the sense strand has 

the sequences x,, x„, where n is greater than or equal to 3, the method 

comprising the steps of 

(1) providing in a single reaction mixture 

(a) the sequences x,, X2 x^, and their complementary sequences 

X,*, X2* x„*, to be assembled into the hybrid molecule, 

(b) for each pair of complementary sequences defined in (a) a 
respective pair of PCR primers each having a priming sequence 
and which are such that the primers hybridising to the 3' ends of 
any two sequences (Xj, x'^j+i^), where i is 1 to (n-1), have 
specifically complementary linker sequences 

(2) effecting a first stage PCR reaction in which those primers provided 
with linker sequences are present in limiting concentrations, and 

(3) effecting a second stage PCR reaction using a single pair of primers 
one of which provides the 5'-end of the sense strand and other of which 
provides the 3'-end of the anti-sense strand of the required hybrid 
molecule 

whereby said hybrid molecule is generated. 

2. A method as claimed in claim 1 wherein the polymerising enzyme adds a 3* 
adenosine overhang to an extended strand and those primers incorporating linker 



wo 99/64624 ^ 



PCT/GB99/01691 



29 

sequences have their priming sequences connected to their respective Hnker sequences 
via an adenine residue. 

3. A method as claimed in claim 2 wherein the polymerising enzyme is Taq. 

4. A method as claimed in any one of claims 1 to 3 wherein the annealing 
temperature (T^,) of the linker sequences is greater than that of the priming sequences 
to the X and x* sequences. 

5. A method as claimed in claim 4 wherein the annealing temperature of the 
linker sequences is 2 to S^^C greater than that of the priming sequences to the x and x' 
sequences. 

6. A method as claimed in any one of claims 1 to 5 wherein the linker sequences 
do not have intrinsic secondary structure. 

7. A method as claimed in any one of claims 1 to 6 wherein between the first and 
second stage PCR reactions the reaction mixture is frozen to deactivate residual PGR 
activity. 

8. A method as claimed in any one of claims 1 to 6 wherein between the first and 
second stage PCR reactions the reaction mixture is treated with an exonuclease I to 
digest single stranded molecules. 

9. A method as claimed in any one of claims 1 to 8 wherein each of the first and 
second stage PCR reactions utilise a thermally activated polymerase. 

10. A method of mutation analysis wherein the analysis is effected on a DNA 
hybrid molecule produced in accordance with the method of any one of claims 1 to 9. 
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11. A set of primers incorporating the following sequences. 

5 ' tcatattagccgctgcattgcc-a-3 ' 
5'ggcaatgcagcggctaatatga-a- 3' 

5'agccactacccaaactcctgt-a-3' 
5'acaggagtttgggtagtggct-a-3* 

5'tgtctcactgaacctgcctacct— a-3' 
5'aggtaggcaggttcagtgagaca-a-3' 

5'cctcat taccggctgtcagactg-a-3' 
5'cagtctgacagccggtaatgagg-a-3' 

12. A method of producing a hybrid DNA molecule having a sense strand and an 
anti-sense strand and in which, reading in the 5* to 3' direction, the sense strand has 

the sequences x,, X2 x^, where n is greater than or equal to 3, the method 

comprising the steps of 

(i) providing in a single reaction mixture 

(a) the sequences x,, Xj x„ and their complementary sequences 

x/, X2* x„', to be assembled into the hybrid molecule, 

(b) for each pair of complementary sequences defined in (a) a 
respective pair of PGR primers each having a priming sequence 
and which are such that the primers for the 3' ends of any two 
sequences (x^, x'^j^,)), where i is 1 to (n-1), have specifically 
complementary linker sequences connected to their respective 
priming sequences via an adenine residue, and 
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1 



(2) effecting a PGR reaction using a polymerase which adds a 3' adenine 
overhang to the end of an extended strand 



wo 99/64624 



PCT/GB99/01691 



1-9 



n 



c 

X " 



m 
X - 



rs4 

X 



X 



to 



1 i 

1 X 



X 



0^ 



X 



4 



IT) -r 



ro 




\ 



X 



X 



I 



CD 



\ 

X 



SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



wo 99/64624 



PCT/GB99/01691 





100bp ladder w 

Exon 6 
Exon 7 
Exon 8 
Exon 9 
Exon 1 0 

100bp ladder o- 
Exon 8 
Exon 9 
Exon 10 

Exon 1 1 
Exon 1 2 

FIG. 3 



SUBSTITUTE SHEET (RULE 26) 



wo 99/64624 



PCT/GB99/01691 



5-9 




SUBSTITUTE SHEET (RULE 26) 



wo 99/64624 



PCT/GB99/0169I 



6-9 




SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



wo 99/64624 



1046bp 




FIG. 7 



SUBSTITUTE SHEET (RULE 26) 



